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For a broad range of single-seed contagion processes acting on generalized random networks, we 
derive a unifying analytic expression for the possibility of global spreading events in a straightfor- 
ward, physically intuitive fashion. Our reasoning lays bare a direct mechanical understanding of an 
archetypal spreading phenomena that is not evident in circuitous extant mathematical approaches. 
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I. INTRODUCTION 



Spreading is a universal phenomenon occurring in 
many disparate systems across all scales, as exemplified 
by diffusion and wave propagation, nuclear chain reac- 
tions, the dynamics of infectious biological diseases and 
computer viruses, and the social transmission of religious 
and political beliefs. Many spreading processes take place 
on networks, or leave a branching network of altered 
entities in their wake, and over the last decade, stud- 
ies of contagion on random networks in particular have 
provided fundamental insights through analytic results 
for abstract models P, Ql- Furthermore, in acknowledg- 
ing the governing roles of the degree distribution Q and 
correlations between nodes generalized random net- 
works Q have been profitably employed in modeling real- 
world networks Thus, a clear, physical understand- 
ing of the dynamics of contagion processes on generalized 
random networks provides a crucial analytic cornerstone 
for the goal of understanding spreading on real- world net- 
works. 

Here, we obtain a unifying analytic expression for the 
possibility of a global spreading event — which we define 
as the infection of a non-zero fraction of an infinite 
network — for a broad range of contagion processes act- 
ing on generalized random networks and starting from a 
single infected seed. We provide both a general frame- 
work and results for a series of specific random network 
families, allowing us to reinterpret, integrate, and illu- 
minate previously obtained conditions. Our explanation 
has obvious pedagogic benefits: While results for these 
families arc known, previous treatments have centered 
around powerful but non-intuitive and indirect mathe- 
matical approaches, typically involving probability gen- 
erating functions 0, |j, @ • We show that a global spread- 
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ing (or cascade) condition can in fact be transparent- 
ly derived by considering local growth rates of infection 
only, such that physical contagion processes are manifest 
in our expressions. 

Our derivation readily accommodates networks with 
an arbitrary mixture of directed and undirected weight- 
ed edges, node and edge characteristics, and node-node 
correlations, and can be extended to other kinds of ran- 
dom networks such as bipartite affiliation graphs . Our 
argument also applies to contagion processes evolving in 
continuous or discrete time, and for the latter case, with 
either synchronous or asynchronous updates. Nodes may 
also recover or stay infected as the outbreak spreads. 

In what follows, we first obtain an inherently physical 
condition for the possibility of spreading on generalized 
random networks, and then provide specific treatments 
for six interrelated classes of random networks. 



II. PHYSICALLY-MOTIVATED DERIVATION 
OF A GENERAL SPREADING CONDITION 

Our goal is to intuitively derive a test for the pos- 
sibility of global spreading from a single seed, given a 
specific random network and contagion process [8|. To 
do so, we construct a global spreading condition based 
on the infection counts of edge-node pairs rather than 
just nodes. While considering how the number of infect- 
ed nodes grows is a more obvious and natural framing, 
and one that has been broadly employed (e.g., the repro- 
duction number in mathematical epidemiology [^), the 
growth of 'infected edges' emanating from infected nodes 
is equally transparent, and opens a door to analytic treat- 
ment. 

Since generalized random networks, correlated or not, 
are locally branching networks 3, successful spreading 
from a single seed must entail nodes becoming infected 
in response to a single neighbor's infection (such nodes 
have been termed 'vulnerable' [l^). For any given conta- 
gion process, we therefore need only examine the trans- 
mission of infection along single edges. Furthermore, suc- 
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FIG. 1: (Color online) Schematic showing an infection poten- 
tially spreading from node-edge pair a' = {v' , A') to node-edge 
pair a = [v, A). 

ccssful spreading leads to exponential growth on random 
networks when one infected edge, on average, generates 
more than one new infected edge. 

In Fig.[Tl we provide a schematic of the spread of a con- 
tagious element through a random network. We frame 
our analysis around the probability that an edge of type 
A' 'infects an edge' of type A through a node of type z/, 
where by type, we mean individual characteristics such 
as node or edge age, node degree, edge direction, edge 
weight, hidden variables, etc. As shown in Fig. [TJ a i^' 
node is already infected due to a A" edge and is conse- 
quently signalling its infection to its neighbors. In par- 
ticular, the A' edge communicates the infection of the v' 
node to the v node and thereby potentially to the marked 
A edge. For an infection to spread, we must account for 
all possible edge-edge transitions incorporating the prob- 
ability of their occurrence based on (1) network struc- 
ture and (2) the nature of the spreading process. Our 
framing leads us to identify node-edge pairs as the key 
analytic components, as indicated in Fig.[l] and we write 
d = {v, A) and a' = {v' , A'). 

We first consider contagion processes with discrete 
time updates and one-shot infection chances. By one- 
shot, we mean that once a node becomes infected, it has 
one time step to infect its neighbors (excluding the node 
which infected it), after which no infection can be trans- 
mitted. We argue that the growth of the expected num- 
ber of type a node-edge pairs first infected at time t, 
fs.{t), follows an exponential growth equation: 

/5(t + l) =^i?55'/5'W, (1) 

a.' 

where Rssi is what we will call the 'gain ratio matrix', 
and which possesses a three-part form: 

The first term Pss' represents the conditional probabil- 
ity that a type A' edge emanating from a type v' node 
leads to a type v node. The middle element kssi is the 



number of type A edges emanating from nodes of type v, 
excluding the incident type A' edge arriving from a type 
I/' node. The last term B^s' represents the probability 
that a type v node is infected by a single infected type 
A' link arriving from a neighboring node of type v' (the 
potential recovery of the infected v' type node is incor- 
porated in Bgs')- The first and second elements encode 
the network's structure, while the third represents the 
spreading phenomenon, and each term's dependence on 
d and d' may be none, part, or whole. In Eq. ^ and 
below, we use the symbol '•' to make clear the composi- 
tion of the three pieces of the gain ratio matrix. 

We can now state the global spreading condition for 
spreading from a single seed on arbitrarily correlated 
random networks with discrete time update: the largest 
eigenvalue of the gain ratio matrix R = [i?Q5'] must 
exceed unity, i.e., 

sup{|/^| : ^ 6 cr(R)} > 1 (3) 

where cr(-) indicates eigenvalue spectrum. 

Next, we can easily accommodate other types of conta- 
gion processes by computing the number of nodes infect- 
ed a distance d away from the seed rather than as a 
function of time. The infection probability B^s' is then 
computed over all time and is interpreted as the prob- 
ability that a node of type v is eventually infected by 
edge A'. We now more generally write fs{d + 1) = 
X^s' Rsa' fs'{d), with Bss'^s role altered and we see that 
the same global spreading condition arises. Therefore, 
Eq. ([3]) applies for contagion processes for which time is 
continuous or discrete, where nodes may recover, etc., all 
providing we can sensibly compute B^s' [HI ■ 

III. APPLICATION TO UNDIRECTED, 
DIRECTED, AND MIXED RANDOM 
NETWORKS 

We now apply our argument to six interrelated classes 
of random networks, connecting to existing results in the 
literature. We consider networks with arbitrary degree 
distributions, mixtures of undirected and directed edges, 
and node-node correlations based on node degree. Our 
general global spreading condition takes on specific forms 
for these networks which are worth deriving individually. 
We summarize the resulting global spreading conditions 
in Tab. H 

We generally follow the approach of Boguha and Serra- 
no 0, who provided a formulation for degree-correlated 
random networks with mixed undirected and directed 
edges. We represent nodes by a degree vector k = 
[fcu h kof^ where the entries are, respectively, the num- 
ber of undirected (or bidirectional) edges between a node 
and its neighboring nodes; the number of directed edges 
leading in to a node; and the number of directed edges 
leading away from a node. For random networks, the 
explicit inclusion of undirected edges is necessary for 
modeling instances of mutual influence between nodes. 
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Network: 


Local Growth Equation: 


Gain Ratio Matrix: 


1. Undirected, 
Uncorrelated 


fid+l)=Rf{d) 


feu 


P'"'(fcu|*)»(fcu-l)»Bfe„,* 




II. Directed, 
Uncorrelated 


f{d+l)=Rf{d) 


fei.fco 


P«(fci,fco|*)»fco.Bfc„. 




III. Mixed 
Directed and 
Undirected, 
Uncorrelated 




7(")(d+l)- 




i^=E 


■ P("^(fcl *) • (fcu - 1) P(')(fc| *) .fcu ■ 


• Bk^ki,* 


IV. Undirected, 
Correlated 




Rk^k'^ = 


p(")(fc,|fc;).(fc,-i).Bfc„fe, 




V. Directed, 
Correlated 




Rk-^kak[k'^fk[k'^{d) 


Rk-^kak[k'^ 


= P'''\h, ko 1 fci', k'o) • fco • Bfeife„fe|fe^ 




VI. IVlixed 
Directed and 
Undirected, 
Correlated 




jf{d+l)_ 




R-fefc' = 


P("'(fc|fc')»(fcu-1) P'''(^|fc')«fcu 


• Sfe^fe^' 



TABLE I: Summary of local growth equations and ratios for six classes of random networks and general contagion processes. 
These equations describe the expected early growth in infection counts, represented by /, starting from a single initial infective 
(or seed). Each gain ratio is written so as to highlight three distinct factors in the following order: (1) the probability of an edge 
leading to a node of a specific type; (2) the number of infected edges arising from a successful infection; (3) the probability of 
successful infection of that node. As the forms show and as discussed in the main text, these three factors depend on the nature 
of the edge potentially transmitting an infection. The gain ratio is a scalar for classes I and II, and a matrix for classes III- VI. 
When the gain ratio is a scalar, the contagion condition is simply P > 1, while for the matrix cases, at least one eigenvalue 
must exceed 1. Classes I-V are special cases of class VI. 



and analytically affords a way of connecting directed net- 
works with undirected ones. 

We write the probability that a randomly selected 
node has degree vector k as Pj:. We represent corre- 
lations between nodes via three transition probabilities: 
P(^)(fc|fc'), P'^'Hk\k'), andP(°)(fc|fc'), which are the prob- 
abilities of an undirected, incoming, or outgoing edge 
leading from a vector degree k' node to a vector degree 
k node. The superscripts therefore refer to the degree k 
node (these conditional probabilities are defined similarly 
to those used in Q , but with the directed cases reversed) . 

As we have argued in general, in finding the global 
spreading condition for random networks, we have to 
determine three quantities: (1) the probability that a 
type A' edge emanating from an infected type v' node 
leads to a type v node where we may have to condition 
on a' and a; (2) in the case of successful infection, the 
resultant number of newly infected outgoing A type edges 
emanating from the type v node; and (3) the probability 
that the type i> node becomes infected. 

We start with the basic case of undirected, uncorrelat- 
ed random networks with a prescribed degree distribution 
Pk^ (class I). The first of the three quantities is given by 



the observation that following a randomly chosen edge 
leads to a degree k^ node with probability k^P^^/ (k^) [7[, 
which we will write as P'^")(fcii | *) with the indicating 
an absence of correlations. Second, if a degree k^ node is 
infected, k^ — l new edges will be infected. And third, we 
have that a degree k^ node becomes infected with proba- 
bility -Bfey,*. Putting these pieces together and summing 
over all possible values of k^ (since the network is uncor- 
related), we arrive at the well known global spreading 
condition for random networks: 

oo 

^ = E I *) • - 1) • ^fe-* > 1- (4) 

fcu=0 

The local growth equation is simple: f{d -f 1) = Rf{d). 
In the case that we set -Ba;„.* = 1, meaning the con- 
tagion process is always successful, we have the condi- 
tion for the presence of a giant component, which was 
obtained by MoUoy and Reed Q in the alternate form 
Efe!=o^u(^u - 2)-Pfe„ > 0. Although Molloy and Reed 
suggested some intuition for this particular form, we 
believe the kind of derivation we have provided here 
is the clearest, most direct formulation. Later, New- 
man et al. Q arrived at the same result using gener- 
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ating functions, specifically by examining when the aver- 
age size of finite components diverged for a family of 
parametrized random networks, and Watts [loj . using 
the same techniques, obtained Eq. ^ for a random net- 
work version of Granovetter's threshold-based model of 
social contagion [l2|. These arguments, while entirely 
effective and part of a larger exploration of the details of 
random networks (uncovering, for example, distributions 
of component sizes), are somewhat opaque and round- 
about. Thus, while we could readily rearrange Eq. 
and our other results below to generate more mathemat- 
ically clean statements, an essential degree of physical 
intuition would be lost. 

In moving to purely directed networks (class II), we 
now allow each node to have some number of incom- 
ing and outgoing edges, ki and k^. The three pieces 
of the gain ratio R are now: (1) upon choosing a ran- 
dom (directed) edge, the probability the edge leads to 
a node with degree vector [ki,ko]'^ is P^'^ (fci, fco | *) = 
kiPkika/ (h)', (2) the consequent number of infected out- 
going edges is simply fco! and (3) the probability of infect- 
ing such a node is Bk^.*- The global spreading condition 
for uncorrelated directed networks is therefore 



R= ^P«(fci,fco 

fci.fco 



• fco • Sfci,* > 1, 



(5) 



and the local growth equation is again f{d+l) = Rf{d). 
The condition for the existence of a giant component, 
found by setting „ = 1 as before, was obtained by 
Newman et al. [7| , again by determining when the aver- 
age size of finite components diverges. Newman et al.^s 
version of the condition is an elegant algebraic rearrange- 
ment of Eq. ([S]) as (2fcofci — fco — fci) = 0; Boguha and Ser- 
rano simplified Eq. ® further to (fco(fci-l)) = smce 
(fco) = (fci). Again, the physics of the process is entirely 
obscured by these mathematically clean statements. 

We next consider random uncorrelated networks with 
arbitrary mixtures of directed and undirected edges (class 
III). As shown in Tab. U the local growth equation now 
accounts for the expected numbers of undirected and 
directed edges a distance d from the seed, /'■"'' (rf) and 
f^°^{d) (outgoing rather than incoming edges are record- 
ed since we have framed our analysis around infected 
edges leaving infected nodes). In computing the expect- 
ed values of /(")(d + 1) and f^°^d + 1), we see the gain 
ratio is a 2 X 2 matrix built around four possible edge-edge 
transitions: undirected to undirected, undirected to out- 
going, incoming to undirected, and incoming to outgoing. 
The corresponding components of the gain ratio matrix 
are P(")(fc | *) . (fcu - 1), P(")(fc I *) • P«(^l*)»fcu, 
and P^')(fc I *) • fco- For all four transitions, the proba- 



bility of infection is Bk^ki,*- Summing over all possible 
degrees, we find global spreading occurs when the largest 
eigenvalue of the gain rate matrix 



R 



E 

k 



P(")(fc 
P(")(fc| 



*) • (fcu - 1) 
*) • fco 



pW(fc 

p«(fcl 



• fcu 

• fcn 



(6) 

exceeds unity. The global spreading conditions for pure 
undirected and directed networks, Eqs. and ([5]) can be 
retrieved by setting either P'")(fc | *) and fcu or P'^''(fc | *) 
and fco equal to zero. 

The above three classes of uncorrelated random net- 
works (I: undirected, II: directed. III: mixed) have nat- 
ural degree-degree correlated versions (IV, V, VI). The 
derivation of their respective global spreading conditions 
follows the same argument with two changes. First, aver- 
aging over node degrees can no longer be done and the 
gain ratio matrix now has entries for each possible tran- 
sition between edge types. Second, all transition proba- 
bilities are now properly conditional, e.g., P^")(fcu | *) is 
replaced with p("^(fcu | fc(j) for pure undirected random 
networks. Consequently, the gain ratio matrix is a func- 
tion of the degrees fc' and fc. The resultant gain ratio 
matrices and the expanded growth equations agree with 
expressions obtained by Bogufia and Serrano [3 , and are 
shown in Tab. |T] 



IV. CONCLUDING REMARKS 

In summary, we have shown that the possibility of 
global spreading for contagion processes on generalized 
random networks can be obtained in a direct, physical- 
ly motivated fashion. A similar kind of clear approach 
should apply for finding the probability of global spread- 
ing. Our work naturally complements that of Gleeson 
and Cahalane [l^ who solved the fundamental problem 
of the final size of an outbreak, in a similarly straight- 
forward way for macroscopic seeds and, in the limit, for 
isolated seeds as well. Obtaining an exact solution for the 
time evolution of spreading from a single seed remains the 
last major challenge for these random network models. 
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